LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
arxiv.orgยท4h
๐Ÿ’ปLocal LLMs
A Kevin week
blog.mitrichev.chยท11hยท
๐Ÿ“Linear Algebra
Topological Sort: Managing Mutable Structures in Haskell
mmhaskell.comยท18m
๐Ÿ”—Topological Sorting
Python Multiprocessing: Start Methods, Pools, and Communication
dev.toยท3hยท
Discuss: DEV
๐ŸŒŠStream Processing
LLM Rerankers for RAG: A Practical Guide
fin.aiยท11hยท
Discuss: Hacker News
๐Ÿ”Information Retrieval
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.comยท4hยท
Discuss: Hacker News
๐ŸงฎKolmogorov Complexity
On training binary neural networks
kevinmartinjose.comยท13h
๐Ÿ“ŠQuantization
A Dumb Introduction to z3. Exploring the world of constraint solvers with very simple examples.
asibahi.github.ioยท11hยท
๐ŸงฎZ3 Solver
Planarizing matchings
11011110.github.ioยท14h
๐ŸŽจGraph Coloring
Disaggregated Inference at Scale with PyTorch and VLLM
pytorch.orgยท1dยท
Discuss: Hacker News
โšกLZ4 Streaming
LangChain, LangGraph, and LangSmith: Untangling the Confusion
dev.toยท8hยท
Discuss: DEV
โœจEffect Handlers
Weighted random generation in Python (2010)
eli.thegreenplace.netยท11hยท
Discuss: Hacker News
๐Ÿ”ขBitwise Algorithms
Things to build with Google's new Nano Banana image editing and generation model
logankilpatrick.medium.comยท17hยท
Discuss: Hacker News
โšกHomebrew CPUs
๐Ÿ“ŠBeyond the Standard: Exploring Modern Python Visualization Tools
dev.toยท1dยท
Discuss: DEV
โŸทBidirectional Programming
Synaptic Shortcuts: Predicting Spike Timing for Ultra-Fast Pathfinding
dev.toยท1hยท
Discuss: DEV
๐Ÿ”ฒCellular Automata
The future of microoptimization
goldenstack.netยท2dยท
Discuss: Hacker News
๐ŸงฎCompute Optimization
Cognitive and Gestalt psychology in your code: SMVP pattern
github.comยท9hยท
Discuss: Hacker News
โœ…Format Verification
I built an LLM from Scratch in Rust (Just ndarray and rand)
reddit.comยท16hยท
Discuss: r/rust
๐Ÿฆ€Rust Borrowing
Creativity Benchmark: A benchmark for marketing creativity for LLM models
arxiv.orgยท4h
๐Ÿง Intelligence Compression
Review: SpikingBrain Technical Spiking Brain-Inspired Large Models
arxiviq.substack.comยท1dยท
Discuss: Substack
โšกCPU Microarchitecture